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Abstract 

We investigate the representation of symmetric polynomials as a sum of squares. 
Since this task is solved using semidefinite programming tools we explore the geo¬ 
metric, algebraic, and computational implications of the presence of discrete sym¬ 
metries in semidefinite programs. It is shown that symmetry exploitation allows 
a significant reduction in both matrix size and number of decision variables. This 
result is applied to semidefinite programs arising from the computation of sum 
of squares decompositions for multivariate polynomials. The results, reinterpreted 
from an invariant-theoretic viewpoint, provide a novel representation of a class of 
nonnegative symmetric polynomials. The main theorem states that an invariant 
sum of squares polynomial is a sum of inner products of pairs of matrices, whose 
entries are invariant polynomials. In these pairs, one of the matrices is computed 
based on the real irreducible representations of the group, and the other is a sum 
of squares matrix. The reduction techniques enable the numerical solution of large- 
scale instances, otherwise computationally infeasible to solve. 
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1 Introduction and preliminaries 


A fundamental problem in real algebraic geometry is the existence and computation of a 
representation of a multivariate polynomial as a sum of squares (SOS). In other words, the 
question of hnding pi G M[x], i = 1,..., iV such that 

N 

/w = • 

i=l 

Besides its obvious theoretical interest, this is also a relevant question in many areas of 
applied mathematics, such as continuous and combinatorial optimization, as well as real 
polynomial system solving via the connections with the Positivstellensatz [20,21,33]. While 
algebraic techniques have been proposed to decide the problem [23], recent results [28,20,22] 
suggest that this problem can be solved much more efficiently using numerical optimization 
techniques such as semidefinite programming (SDP) [34]. 

It is a fact that many problems in applied mathematics and engineering possess inherent 
symmetries in their mathematical formulations. While in some cases this mirrors the un¬ 
derlying structure of existing physical systems, these features can also arise as a result of 
the chosen mathematical abstraction. A natural question, therefore, is the investigation of 
the consequences of these symmetries in the efficiency and performance of the algorithms 
associated with the computation of diverse quantities of interest. 

Motivated by these reasons, we study the problem of hnding sum of squares decompositions 
whenever the given polynomial / is invariant under the action of a finite group (discrete 
symmetries). Our initial focus will be on the consequences of symmetry in general semideh- 
nite programs, emphasizing later the specihc class that appears in the computation of sum 
of squares decompositions and the associated Positivstellensatz relaxations. 

In general, symmetry reduction techniques have been explored in several different contexts. A 
partial list of relevant examples and application areas are dynamical systems and bifurcation 
theory [12], polynomial system solving [35,10], numerical treatment of partial differential 
equations [7], and Lie symmetry analysis in geometric mechanics [18]. Despite belonging to 
very different domains, common themes in all these developments are the conceptual and 
computational benehts of a thorough understanding of the constraints imposed by symmetry. 

The general advantages of symmetry exploitation are numerous: 

Problem size. The hrst immediate advantage is the reduction in problem size, as the new 
instance can have a signihcantly smaller number of variables and constraints. This notably 
affects the total running time. A well-known application area where these techniques are 
employed is hnite-element methods for partial differential equations [7]. 

Degeneracy removal. A very common source of difficulties in numerical methods is the 
presence of repeated eigenvalues of high multiplicity. These usually arise from structural 
properties, and can be removed by a proper handling of the symmetry. 
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Conditioning. For the above mentioned reasons, the resulting symmetry-aware method¬ 
ologies in general have much better numerical conditioning. Additionally, the elimination 
of non-essential variables allows for faster and more accurate solutions. 

Reliability. Smaller size instances are usually less prone to numerical errors. For instance, 
numerical path following is much more accurate in a hxed point space of smaller dimension 
than in the original high dimensional space. 


In later sections, we illustrate the concrete form that these general benehts take in the 
problem of computation of sum of squares decompositions. 


For the exploitation of symmetry of hnite groups there are two different, but complementary, 
approaches. The first one is the theory of linear representations^ as presented in [7,27], and 
the second one being invariant theory [31,6]. We make use of both approaches in our work, 
and show how the results can be naturally interpreted in either framework. 


A particularly interesting application of the sum of squares (SOS) methods, outlined in 
Section 2, is the optimization of symmetric polynomials. In [20,22], and based on [28], an 
SDP-based algorithm for the computation of a global lower bound for polynomial functions is 
described. When the polynomial to be minimized has certain symmetry properties, then the 
theory described here can be fruitfully applied. In related work, Choi et al. in [4] have shown 
how symmetry reduction techniques can signihcantly reduce the complexity of analyzing 
SDP/SOS conditions. 


The outline of the paper is as follows: in Section 2 we start by recalling the intimate con¬ 
nections between sum of squares and semidehnite programs, as well as their applications in 
optimization. In Section 3 the concept of an invariant semidefinite program is introduced. 
The key result in this section relies critically on the convexity property, and shows that 
the feasible set can always be restricted to a specihc invariant subspace, thus reducing the 
problem to a semidehnite program of smaller dimensions. In Section 4, linear representation 
theory is employed to hnd a change of coordinates where all the matrices in the SDP have 
block diagonal form. The SDP therefore collapses into a collection of coupled smaller opti¬ 
mization problems, which are much easier to solve. In the following section, we concentrate on 
the sum of squares techniques for symmetric polynomials, where the group representation 
on matrices is induced by an action on the space of monomials. We show that the com¬ 
plexity of checking positive dehniteness conditions simplihes dramatically, and illustrate the 
techniques with numerous examples. Examples from the literature [1,26,22,20,29] have been 
computed with SOSTOOLS [24]. An invariant-theoretic viewpoint is introduced in Section 6, 
stressing its computational efficiency and the natural connections with the representation- 
based approach developed earlier. As part of the developments, we obtain a novel and very 
appealing representation of symmetric SOS polynomials as a sum of inner products between 
invariant matrices, one of them being an SOS matrix (Theorem 6.2). Finally, in Section 8, 
the computational savings of our techniques are illustrated in some large-scale problems. 
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2 Optimization of symmetric polynomials and sum of squares decompositions 


It is well-known that the problem of global minimization for multivariate polynomials is an 
NP-hard problem. The technique we describe, which was first proposed by Shor [28], is a 
relaxation: it produces a certihable lower bound for the optimal value, and in practice very 
often provides the correct answer. We provide a brief outline below. 

Consider a given polynomial f{xi, . . . ,Xn) of even total degree 2d. If we can hud a real 
number A and polynomials Pi G M[a;i,..., a;„], i = 1,..., iV such that: 

N 

/(x) - A = (1) 

i=l 

then clearly A is a global lower bound of /(x), i.e., /(x) > A for all x G M”. As shown in [28], 
such a decomposition, if it exists, can be found using convex optimization, and in particular, 
semidehnite programming (see [22,19,17] and the references therein). 

We illustrate the idea in the general case, i.e., when the polynomial / is dense. We write 
/(x) = Y"’"AY, where A is a real symmetric matrix and Y denotes the column vector whose 
entries are all the monomials in xi,..., Xn of degree at most d. The length of the vector Y 
is equal to the binomial coefficient 


N = 

y d 

However, the matrix A in this quadratic representation is not necessarily unique. Let 
denote the vector space of real symmetric N x Wmatrices. Then 

Cf = {AeS^\ /(x) = Y^AY}. 

denotes the set of all real symmetric N x Wmatrices A for which the representation is valid. 
This is an affine subspace in the space of real symmetric N x Wmatrices. Assume that the 
constant monomial 1 is the first entry of Y. Let Eu denote the matrix unit whose only non¬ 
zero entry is a one in the upper left corner. Then Cf-\ = {X G \ /(x) — A = Y^QY} 
where Q = A — X ■ Eu. 

Lemma 2.1 ([22]) Let /(x) G M[x]. For any real number \, the following statements are 
equivalent: 

(i) The polynomial /(x) — A has a representation as a sum of squares in M[x]. 

( a ) There is a matrix Q G /^/-a sueh that Q is positive semidefinite, that is, all eigenvalues 
of this symmetric matrix are nonneqative reals. 

(Hi) There is a matrix A E Cf sueh that A — X ■ Eu is positive semidefinite. 

Proof: The matrix A — X ■ Eu is positive semidefinite if and only if there exists a real 
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factorization A — X ■ En = ■ B. If this holds then 

/(x) -\ = Y^-A-Y-\ = Y^-{A-\- Eu) ■ Y 
= Y^ ■ B^B • F = {BYf ■ {BY) = 

is a sum of squares, and every representation as sum of squares arises in this way. □ 

As we shall see next, Lemma 2.1 implies that we can hnd representations of a polynomial 
as a sum of squares by solving a particular kind of convex optimization problems called 
semidefinite programs. 

The standard semidehnite program [34] is given in the following way. The inner product on 
is denoted by (•, •) and let C denote the set of positive semidehnite matrices. It 
is well-known that the set forms a convex cone. Given a symmetric matrix C & we 
dehne a linear cost function E : M by E{X) = {C,X). Given an affine subspace C of 

, the convex optimization problem is: 

E := min (G,X), 
x&cns^ 

where we minimize the linear cost function E{X) = {C,X), over the set of feasible matrices 
which is the intersection of an affine matrix subspace C with the set of positive semidef¬ 
inite matrices. This is a hnite dimensional convex optimization problem, for which efficient 
algorithms are available [34]. 

For the case of sum of squares the linear subspace of symmetric matrices representing f{x) — X 
is C = {X G I 3 A e M with f{x) — X = Y'^XY} while the cost function is E{X) = (G, X) 
with C = Ell. We write for the largest real number A for which the minimum of E{X) 
is attained. is a lower bound for the global minimum f* of / on This inequality may 
be strict. It is even possible that = —cx) for the case that no A exists such that f{x) — A 
has a representation as sum of squares. We refer the reader to [22] for more background, 
numerical results, and comparison with other methodologies. 

Sum of squares matrices: We introduce now the notion of sum of squares matrices. These 
will be matrices, with polynomial entries, that are “obviously” positive semidehnite for all 
values of the indeterminates. Therefore, this provides a natural generalization of the sum of 
squares condition as an easily verihable certihcate for polynomial nonnegativity. 

Definition 2.2 Let S G R[x]™^™' be a symmetric matrix, and y = [yi,... he new 
indeterminates. The matrix S is a sum of squares (SOS) matrix if the scalar polynomial 
y^Sy is a sum of squares in M[x, y]. 

It is clear that this concept reduces to the standard SOS notion for 1x1 matrices, i.e., scalar 
polynomials. 
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Example 2.3 Consider the matrix S G R[a;], given by: 


S = 


— 2 x + 2 X 


X 


X 


This is a SOS matrix, sinee 


y^Sy = 


p _ 

T 

p _ 


p -l 

yi 


2-101 


yi 

xyi 


-1 10 0 


xyi 

y2 


0 0 0 0 


y2 

xy 2 _ 


1 0 0 1 


xy 2 _ 


= {yi + xy 2 f + {xyi -yif 


0 

An important result about SOS matrices is the fact that S G (notice n = 1 here) is 

positive semidehnite for all x if and only if it is a SOS matrix. This is a direct restatement 
of the fact that (2,n;m, 2) positive semidehnite biforms are SOS; see [3] and the references 
therein. 

Since the SOS matrix condition is equivalent to a scalar SOS test in a polynomial with 
more variables, it is obvious that this can also be computed using semidehnite programming 
techniques. Furthermore, because of the special bipartite structure, only monomials of the 
form x^yj will appear in the vector Y, allowing for a more efficient implementation. 

As mentioned, in many practical applications the polynomial / has symmetry properties, 
such as being invariant under a group of transformations. 

Example 2.4 Consider the quartic trivariate polynomial analyzed in [22]: 

f{x, y,z) = X* + y* + z'^ — A xyz + x + y + z. 

This polynomial is invariant under all permutations of {x,y,z} (the full symmetrie group 
S 3 ). The optimal value is f* = —2.112913882, aehieved at the orbit of global minimizers: 

(0.988, -1.102, -1.102), (-1.102, 0.988, -1.102), (-1.102, -1.102, 0.988). 

For this polynomial, it holds that = f* ■ <) 

The symmetry appearing in polynomials (such as / above) suggest that it may be possible 
to customize the general SDP-based methodology to obtain more efficient algorithms. We 
present next our results for a new class of symmetric semidehnite programs, specializing them 
later to those that arise in the computation of sum of squares decompositions of symmetric 
polynomials. 
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3 Invariant semidefinite programs 


In order to introduce our techniques for the computation of sum of squares decompositions 
for invariant polynomials we start by analyzing the consequences of symmetry in arbitrary 
semidehnite programs. We consider hrst the most general cases, restricting in later sections 
to the special symmetries that arise in our specihc class of problems. The major result is 
that the symmetric semidehnite optimization problem can always be reduced to an easier, 
smaller problem because of its convexity^ see Theorem 3.3 below. 

As mentioned in the previous section, a semidehnite program is dehned as follows: given an 
affine subspace C of and a symmetric matrix C G we have 

F:= min (C',X), (2) 

where we minimize the linear cost function F{X) = {C,X), over the set of feasible matrices 
C n . Again C denotes the space of symmetric N x N real matrices, equipped 

with the inner product (•, ■) ; xS^ M, and C be the cone of positive semidehnite 
matrices. 

In several application domains, the SDPs to be solved involve discrete symmetries. In other 
words, there exists a hnite group G, and an associated linear representation a : G ^ Ant (5'^) 
for which certain relations hold. The hrst requirement is that maps onto itself under the 
group action, i.e., 

a{g){S^)CS^, WgeG. (3) 

In subsequent sections we will require further specihc properties for the representation a, 
such as being induced by a representation p : G ^ GL(M.^), via 

(^{9){X) := p{gfXp{g), for X G c/ G G, (4) 

as this will be the case in most practical instances. Obviously, the required property (3) is 
satished for such induced actions. However, the results in the present section are valid for 
arbitrary linear representations a, not necessarily of the form (4) above. 

In order to describe and investigate our invariant semidehnite programs we need a few 
more dehnitions. A subset C C is called invariant with respect to a if X G £ implies 
(j{g){X) E C, \/g E G. We already required the property that the cone of positive dehnite 
matrices is invariant. For a semidehnite program (2) to be invariant we will demand that 
the set of feasible matrices be invariant. In other words, if a point belongs to the feasible set, 
so do its images under the group action: 

XG£n5^ ^ a{g){X) E CnS^ ygEG. 

A linear mapping F : M is called invariant with respect to a if for all X G we 

have F{X) = F {a{g){X)) for all g E G. For the cost function we may restrict this property 
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to the feasible set 


{C,X) = {c,a{g){X)), ^g e G, yXeCnS^. 

Equivalently, all points in a group orbit Ox = {X G \ 3g E G with a{g){X) = Y} have 
the same cost. 

Definition 3.1 Given a finite group G, and assoeiated linear representation a : G ^ 
Aut{S^), a semidefinite optimization problem of the form (2) is ealled invariant with re- 
speet to a, if the following conditions are satisfied. 

(i) The set of feasible matrices C fl is invariant with respect to a. 

(a) The cost function F{X) = {G,X) is invariant with respect to a. 

A related class of SDPs with symmetry has been dehned independently by Kanno et al. in 
[15], where the authors study the invariance properties of the search directions of primal-dual 
interior point methods. Our results on the computational simplications that are possible for 
these problems, however, are new as far as we can tell. 

The group invariance of both the feasible set and the objective function suggests the possi¬ 
bility of considering a new optimization problem. Concretely, for the invariant semidehnite 
program dehned in Dehnition 3.1, an eguivalent optimization problem with a smaller number 
of variables and the same optimal solution can be obtained. The result is formally stated 
in Theorem 3.3 below. To this end, in addition to invariant sets we also dehne invariant 
matrices as those that are hxed under the group action, i.e., they satisfy: 

X = a{g){X), WgeG. 

Remark 3.2 The linear representations a we are interested in are in general orthogonal, 
i.e. the operators a{g) G Aut{S^) satisfy that their inverse is egual to the adjoint map, for 
all g E G. Then the matrix G defining a a-invariant linear cost function F{X) = {G,X) is 
itself an invariant matrix since 

(C, X) = (c, a{g)x) = {a{gYG, x) = {a{g-^)G, x) for all XESfigEG. 


We dehne the fixed-point subspace of iS" (also called G-stable or invariant) as the subspace 
of all invariant matrices: 


F:={XES-\X = aig)iX) ^gEG}, 

and the associated semidehnite program: 

F^-.= min (C,X), 
x&rncnS2 ' 


( 5 ) 

( 6 ) 


obtained by restricting the feasible set of (2) to the hxed-point subspace T. 



Theorem 3.3 Given an orthogonal linear representation a : G ^ Aut(iS"') of a finite group 
G consider a semidefinite program which is invariant with respect to a. Then the optimal val¬ 
ues F, Ffj of the original semidefinite program (2) and the fixed-point restricted semidefinite 
program (6) are equal. 

Proof: It is clear that F < F^, since every feasible matrix of (6) is also a feasible matrix 
for (2). For the other direction, given any e > 0, consider a feasible matrix Xi for (2), such 
that {C,Xi) = F -\- e. We will show that there exists a G JF fl £ fl iS" with the same 

cost {C, Xfi) = F + e. This implies F^j < F + e. Dehne the “group average” (or Reynolds 
operator): 

1^1 g&G 

We show hrst that Xo- is a feasible point for (6). Since Xi G £ fl 5" also a{g){Xi) G £ fl iS" 
by condition (i). First this means a{g){Xi) G £ and thus Xa- G £ because £ is a linear space. 
Secondly, (7{g){Xi) is positive dehnite. Since the positive matrices form a convex cone each 
convex combination such as the group average dehnes another positive dehnite matrix. Thus 
X(j G £ n S^. Also, by construction, X^ G T. Finally, the cost function has the same value 
for X\ and X^ by condition (ii): 

) = AE(C.Vi> = (C,Vi>=F + £, 

1^1 c/eG 

•so Ffj < F F e., from where the result follows. □ 

The crucial property that will enable an efficient use of symmetry reduction in semidehnite 
programs is their convexity. For general non-convex optimization problems, symmetries imply 
the existence of an orbit of minimizers (so there are still numerical advantages, see [35]), 
although the solutions themselves do not necessarily have any symmetric structure. When 
convexity is present, as we have seen, the situation is much better, and the search for solutions 
can always be restricted a priori to the hxed-point subspace. 

Example 3.4 The following very simple example illustrates the ideas. Gonsider a semidefi¬ 
nite program for n = 2 with C = and write X = {Xifi for positive semidefinite matrices. 
Then 

min (C, X) = min (Xu + X 22 ), where C = 
xe52 x&sl 

This semidefinite program is invariant under the group C 2 = {id, s} acting as a : C 2 
Ant (5^) with 



Xn 

X 12 


X 22 —X 12 

a(s) 

_Xi2 

X22_ 


1 

1 

bO 

1_ 


The fixed point subspace is T = {X G | X 22 = Xii,Xi 2 = 0}. By Theorem 3.3 the 


1 0 
0 1 


1^1 g&G 
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original problem is equivalent to the trivial linear program: 


min (C, X) = min (2Xii 


❖ 

For this particular example, the same conclusion could obviously be obtained by using el¬ 
ementary arguments. As we will see next, these basic techniques, in combination with the 
algebraic properties of the group representations that usually arise in practice, can be suc¬ 
cessfully applied to much more complicated examples. 


4 Induced actions and block diagonalization 


In most practical cases, the representation a of G on has a quite special form, since it 
arises as a consequence of a group action of G on This additional structure enables a 
computationally convenient characterization of the hxed point subspace. Essentially, under a 
convenient change of coordinates, every invariant matrix will be block diagonal. The details 
are provided below. 

The semidehnite programs we investigate in this section are invariant with respect to a linear 
representation a : G ^ Ant(5^) which is indueed by an action p : G ^ GL{M.^), via: 

(^i9){X) := p{gfXp{g), MgeG. (7) 

We assume that the representation p is orthogonal, that is, p{gYp{g) = p{g)p{g)'^ = In, for 
all g & G. Then, the restriction to the hxed point subspace (5) takes the form 

p{g)X-Xp{g) = 0, WgeG. (8) 

Matrices commuting with the action of a group as in (8) occur in many situations in mathe¬ 
matics and applications. Using the Schur lemma from representation theory, it can be shown 
that all such matrices can be put into a block diagonal form by a linear change of coordinates. 
In the following we briehy recall this well-known result and discuss its consequences for our 
invariant semidehnite programs. 

In what follows, we consider hrst complex representations, as the theory is considerably sim¬ 
pler in this case. For a hnite group G there are only hnitely many inequivalent irreducible 
representations di,..., of dimensions (or degree) ni,..., Uh, respectively. The degrees n* 
divide the group order |G|, and satisfy the constraint Yli=i = |G|- Every linear representa¬ 
tion of G has a canonical decomposition [27,7] as a direct sum of irreducible representations, 
i.e.; 

p = midi © m2'd2 © • • • © mhldh, 
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where mi,, mn are the multiplicities. According to this there is an isotypic decomposition 


C- = Vi®---®Vh, V, = Va®---® V,n.. 


A basis of this decomposition transforming with respect to the matrices ^i{g) is called sym¬ 
metry adapted and can be computed using the algorithm presented in [27,7]. This basis 
dehnes a change of coordinates by a matrix T collecting the basis as columns. By Schur’s 
lemma T~^XT has block diagonal form with one block Xi for each irreducible representation 
of dimension miUi which further decomposes into n* equal blocks Bi of dimension mi. that 
is: 


T-^XT = 

Xi 

0 

, X, = 

B, 

0 


0 

Xh 


0 



For our applications of semidehnite programs, the problems are usually presented in terms 
of real matrices, and therefore we would like to use real coordinate transformations. In fact a 
generalization of the classical theory to the real case is presented in [27, p. 106-109], and has 
been used many times in bifurcation theory [12]). If all ^i{g) are real matrices the irreducible 
representation is called absolutely irreducible. Otherwise, for each dj with complex character 
its complex conjugate will also appear in the canonical decomposition. Since p is real both 
will have the same multiplicity and real bases of V) +17 can be constructed. So two complex 
conjugate irreducible representations form one real irreducible representation of complex type. 
There is a third case, real irreducible representations of quaternonian type, seldom seen in 
practical examples. A real irreducible representation of quaternonian type decomposes over 
the complex numbers as twice a complex irreducible representation with real character, but 
complex representation matrices. 

Since p is assumed to be orthogonal the matrix T can also be chosen to be orthogonal, too. 
Thus symmetric matrices A are transformed to symmetric matrices AT. For symmetric 
matrices the block corresponding to a representation of complex type or quaternonian type 
simplihes to a collection of equal subblocks. 


The change of coordinates affects both the explicit description of the affine linear space C 
and the cost function F{X) = {C,X). 

Theorem 4.1 Let G be a finite group with h real irreducible representations. Let a be a 
linear representation of G on induced as in (1) by an orthogonal linear representation p 
on . Given a semidefinite program which is invariant with respect to a then there exists 
an explicitly computable coordinate transformation under which all invariant matrices are 
block diagonal. As a consequence, the semidefinite program can be simplified to 


h 


min 

X&C,Xi&Sfi 


Y^nfiG^Xfi, 


(9) 
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where rrii are the multiplieities in p and Ui the dimensions of the real irredueible representa¬ 
tions. Ci denote the blocks of C as well as Xi denote the blocks of X in the new coordinates, 
respectively. 

The semidefinite program in Theorem 4.1 is simply the hxed-point restricted semidehnite 
program (6) rewritten in a convenient coordinate frame in which the test of positive deh- 
niteness can be done much more efficiently. 

Remark 4.2 In solving the semidefinite programs, we only need to consider one block from 
each isotypic component. In other words, instead of a large SDP of dimension 
we solve instead h coupled semidefinite programs of dimension . Note also that some, or 
many, of the multiplicities mi can be zero. 

Example 4.3 This example illustrates both procedures (Theorems 3.3 and 4.1). Consider 
the SDP with problem data given by: 



0 0 0 


f 

abb 


c = 

0 1 0 

and C = < 

X E 3 a, 6, Cl, C 2 ,d eM. with X = 

b Cl d 



0 0 1 


< 

b d C 2 

> 


Then the semidefinite program 


min iC,X)= min (ci + C 2 ) (10) 

is invariant under the action of the group S 2 given by simultaneous permutation of the last 
two rows and columns. To restrict the problem to the fixed point subspace, we can impose the 
constraint ci = C 2 = c, obtaining: 


(11) 


S 2 has two irreducible representations of dimension ni = n 2 = 1 which are also real irre¬ 
ducible. The multiplicities in the current representation are mi = 2, m 2 = 1. A symmetry 
adapted orthogonal coordinate transformation is given by the matrix 


min (2c) where IF = {X E I 3 a, 6, c, d G M with X = 
x&j^ncnsl 


abb 
bed 
b d c 


T = 


1 0 0 

0 q ; a t 

0 a —a 


1 



Now, the block diagonalization procedure can be applied, and the constraints are simplified 
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to: 


T^CT = 


XT = 


Cl 0 
0 ^2 

Xi 0 ^ 

0 X 2 


with Cl = 


0 0 
0 1 


and C 2 = 1 and 


with Xi = 


a V2b 
y/2b c + d 


and Xo = c — d. 


By Theorem 4-1 the optimization problems (10) and (11) are equivalent to 


min ((Ci,Xi) + (C 2 ,X 2 )) = min ({c + d) + {c-d)) 
Xi&sl,X2>o^ > / \ XiScSj, X2>o'^^ ^ ^ C 


= min (2c). 
Xi&Sl,X2>0 


( 12 ) 

<> 


5 Sum of squares for invariant polynomials 


The third level of specialization occurs when the semidehnite programs to be solved corre¬ 
spond to the computation of a sum of squares decomposition of multivariate polynomials 
[20]. In this case, certain aspects of algebraic invariant theory [30,31,6] appear very naturally 
in the picture. 

Assume we are interested in finding a sum of squares decomposition of a polynomial /(x) of 
degree 2d in n variables which is invariant with respect to a linear representation d : G — 
GL{W^), i.e., /(x) = f {'0{g)x.), ^g E C. The set of all such invariant polynomials is the 
invariant ring R[x]*^. The representation d on induces another linear representation r of 
G on the polynomial ring R[x] by T{g)(^p{x.)^ = p(^'0{g~^)x^. In this section, we study how 
this structure can be exploited computationally. 

Example 5.1 On a first thought, one eould jump to the eonelusion that since the polynomial 
f is invariant under the group action, we could further restrict the polynomials Pi in the 
representation (1) to being invariant, i.e., those in the trivial isotypic component. But this 
is not true, as the following simple example illustrates. Consider the polynomial: 

f{x) := x‘^ + {x — + 2x^, 

invariant under the C 2 action generated by x ^ —x. The invariant ring is generated by 
9{x) = x"^, so the term x® cannot occur as the leading term of a sum of squares of invariant 
polynomials since 9“^ = x^^ and {9‘^fi = x^. <0 

Instead of restricting to invariant polynomials we need to deal with the full polynomial ring. 
Analog to the isotypic decomposition of R" in Section 4 the polynomial ring also has an 
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isotypic decomposition 


K[x] — Vi © V2 © • • • © V/i, Vi — Vii © • • • © Virn- (13) 

Here the isotypic component Vi with respect to the trivial irreducible representation is the 
invariant ring M[x]'^. The elements of the other isotypic components are called semi-invariants 

[30]. 

In order to hnd a symmetry adapted basis of an isotypic component of R[x] corresponding to a 
real irreducible representation dj of dimension n* > 2 one would like to compute equivariants. 

A polynomial mapping f : ^ M”* with /j G M[x] and 

/(d(^)x) = di(^)/(x), ygeG, 


is called ^-di-equivariant. 

Then the components form a basis which transforms with respect to dj, i.e. 

they form a symmetry adapted basis with respect to dj. Thus /i,... G Vi and /i G 
Vii,..., fm G Vim- There is a comprehensive body of theory that describes the invariants 
and equivariants [30,10]. 

The following lemma is the key of our developments: 

Lemma 5.2 ([30,10]) The set of 'd-'di-equivariant forms is a finitely-generated module over 
the invariant ring M[x]'^. 

By this lemma each Vij is a hnitely generated module over R[x]'^. The algorithm for the 
computation of a symmetry adapted basis of R” specializes to the computation of invariants 
and equivariants by analogue projections as outlined in [30,9,35]. 

There is much more known about the algebraic structures of the isotypic components V) of 
the polynomial ring which also helps to hnd generators. We will deal with this in Section 6. 
Within this section we just mention that the dimensions of the corresponding vector spaces 
(and therefore, the reduced SDPs) can be easily computed. 

The natural grading of R[x] induces a grading on the V). Each Vi decomposes into vector 
spaces of homogeneous polynomials. 


00 

V. = ©(r.)i 

k=0 

The formal sum 

00 

'/'.({) :=i:dim((V;0©'= 

fc =0 
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in the unknown ^ is the well-known Hilbert-Poincare series. For the invariant and semi- 
invariants it is usually called the Molien series. It can be computed as: 


1 ^ trace 


(14) 


Software is available for the computation of these series and the generators of the invariant 
ring and the module of equivariants. While the Molien series in available in Singular, Magma 
and Macaulay 2 [13] the more specialized Hilbert-Poincare series for semi-invariants can be 
computed with special Maple packages [11,16] and in Magma. That means that for the 
isotypic components (Id)^ the dimension and a vector space basis can be easily computed. 


Recall from Section 2 that we are interested in hnding for a given d-invariant polynomial / 
a positive dehnite matrix A G such that 

/(x) = AY, 


where we took A = 0 without loss of generality. The representation d induces the orthogo¬ 
nal representation p on R[x]<d and thus a on by (7). By Theorem 4.1 there exists an 
orthogonal change of coordinates T such that 

/(x) = Y'^TT^ATT'^Y = {T^Yf{T^AT){T'^Y), 

where T^Y = (wi, ... ,Vh) with Vi = {vn,... ,Virn) with Vij G (M[x])™b Each component of 
Vij is an element of Vij in (13). Again Schur’s lemma gives 


T^AT = 

Ai 

0 


Qi 

0 


0 



0 

Qi 


The representation of the SDP cost function F{X) = {En, X) = /(O) — A is not affected by 
this change of coordinates. 

If / can be represented as a sum of squares, then there exists a representation of the form: 

= QiViji^). (15) 

i=lj=l 

Notice that in the expression above, each of the h terms correspond to a different irreducible 
representation of the group G. Of course, just like in the non-symmetric case, the represen¬ 
tation is in general not unique. 

Theorem 5.3 Let f be a'd-invariant polynomial. Then, f{x) — can be expressed as a 
sum of squares of invariant and semi-invariant polynomials. 
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^2 

^3 

^5 


d 


1 

1 

-1 
-1 
0 -1 
1 0 


1 

-1 

1 

-1 
0 1 
1 0 


Table 1 

Irreducible unitary representations of the dihedral group -D 4 . 

Example 5.4 In this example we analyze a variation of the Robinson form, deseribed in 
[26], This instanee has an interesting dihedral symmetry, and it is given by: 


4„.2 


f{x,y) = x^ + y^ -X y 


4 2 

y X 


X 


y 


x^ — y"^ + ?>x^y‘^ + 1 . 


(16) 


This polynomial is a dehomogenization of Robinson’s form, i.e., f{x,y) = R{x,y,l). It is 
invariant with respect to a representation of the dihedral group of order 8, with the actions 
on generated by: 


il{d){x, y) = {-y, x), il{s){x, y) = {y, x). 

The generators d, s have period 4 and 2, respectively. They satisfy the commutation relation¬ 
ship s = dsd. The group has five irreducible representations, shown in Table 1, of degrees 
1,1,1,1, and 2 [27,7], All of them are absolutely irreducible. 

A symmetry-adapted basis for the five isotypic components can now be obtained, using the 
algorithm described in [7], obtaining the corresponding basis vectors: 


vi = {l,x^-Iy‘^) V51 = {x,x^,xy^) 

V2 = % V52 = iy,y^,yx^). 

V3 = (xy) 

V4. = {x‘^ - y"^) 


(17) 


The corresponding Molien series for each irreducible component all have the common de¬ 
nominator q{f) = (1 — ^^)(1 — and are given by: 

= = V.3(?) = |y. «« = !). = (18) 

Notice that from these, by expanding the series, we can directly read the dimensions of the 
bases (17) (and therefore the sizes of the corresponding SDRs), but much more important, 
we can do this for any other polynomial with the same symmetry group. As expected, ni ■ 
'’Pii.O = ( 7 ^ 1)2 1 Hilbert series of the ring of polynomials in two variables. 
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As suggested by Theorem 4-1, after this symmetry reduetion, the resulting semidefinite pro¬ 
grams are much simpler. The affine subspaee C is given by all matrices Q G which have 
block diagonal form with blocks Qi, Q3, Q4, Q5, Q5 restricted to be 

— 1 — 2ci C3 C4 

C3 1 C5 , 

C4 C5 —1 — 2c5 
Q3 = 1 - 4 c 2 - 4 c 3 - 4 c 4 , Qi = -I - C 2 - 2 c 3 , 

where 04,02,03,04,05 are real numbers. Then the cost function 1 — \ is minimized over the 
set of feasible matrices C fl 5^°. That means each block Qi is restricted to be a positive 
semidefinite matrix. 

The optimal solution has \* = —1|||, with the corresponding matrices being 


Qi — 


1 — A oi 

Ol 02 


Qb — 


Qi — 


with Cholesky factors 


1 

'25 55' 

7921 89 


64 8 8 

4096 128 




= LiL ^, Q3 = Q4 = 0, Qb = 

_5 1 

8 

89 1 



128 4 _ 


5-11 

L 8 J 


= l,lI, 


( 19 ) 


Li = 


The previous theorem then yields that the sum of sguares is 

89 1,0 / 5 


0 

89 


0 0-1 

64 

, Lb — 

0 0 1 


0 

1 




2 


1 

0 

0 

1 

1_ 


/(x,?/) - A* = +-(a;2+ 1/2)^ + + + (^--y+ y^ - x^y^ . 

Notice that the last two terms are not invariant under the group action. However, their sum 
is, and this property can be fully exploited, as we shall see in Section 6. <0 


Example 5.5 Consider the biguadratic form in six variables: 


B(x; y) = xjyf + xjyl + xjyj - 2 {xiX 2 yiy 2 + X2X3y2y3 + X 3 Xiy 3 yi) + 

+ {xlyl + xlyl + xlyl). ( 20 ) 

which is a minor modification of the celebrated example by Choi [1]. Choi’s original example 
was -B(x, y) + {x‘ly 2 +xlyl-\-xlyf), this modified version was studied by Choi and Lam [2]. This 
form has several interesting properties, such as being nonnegative, even though it cannot be 
written as a sum of sguares of polynomials. The associated matrix map B : ^ provided 

one of the first examples of a non-decomposable positive linear map. 
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The form B has numerous symmetries. The assoeiated group has 96 elements, with the 
corresponding generators given by: 


{xi,X 2 , X 3 , yi,y 2 , ys) ^ {-xi,X 2 , X 3 , -yi,y 2 , Vs) 

^ixi,-X2,X3,yi,-y2,y3) 

^ixi,X2,-X3,yi,y2,-y3) 

^ixi,X2,X3,-yi,-y2,-y3) 

^ix3,xi,x2,y3,yi,y2) 

^iy3,y2,yi,x3,x2,xi). 


The group has 14 irreducible representations: four one-dimensional, five of dimension two, 
four of dimension three, and one of dimension sia; (4 ■ + 5 ■ 2^ + 4 ■ 3^ + 1 ■ 6^ = 96j. The 

irreducible representations can be easily obtained using GAP4 [8], 


As mentioned, Choi showed that i?(x;y) is nonnegative, but cannot be written as a sum of 
sguares. Consider now the problem of checking whether /(x;y) ;= (J2ixj + yf)B{'x;y) is 
a sum of sguares. Clearly, all the symmetries of B are inherited by f, since the additional 
factor is invariant under unitary transformations. 


We apply next the symmetry reduction procedure introduced in Section 5. After solving the 
SDPs we find a SOS decomposition where only two of the fourteen irreducible representations 
appear: a two-dimensional representation with multiplicity mi = 3 and a 6-dimensional 
representation with multiplicity m 2 = 4. Indeed, f can be written as: 


2 6 

/(x; y) = J 2 X 2 jQ 2 V 2 j, 

j=i i=i 


( 21 ) 


where 


= {X3yiy2,x2yiy3,xiy2y3), 

V21 = ix 2 y 2 y 3 ,X 3 yi‘^,xiyiy 3 ,X 3 y 3 ‘^), 
V23 = {xiX 2 y 2 ,X 3 ‘^yi,XlX 3 y 3 ,Xl‘^yi), 
V25 = {x 2 X 3 y 3 ,xi^y 2 ,xiX 2 yi,X 2 ^y 2 ), 


Vi2 = {x 2 X 3 yi,XiX 3 y 2 ,XiX 2 y 3 ), 

V22 = {,xiyiy2,X2y3^,X3y2y3W2y2^), 
V24 = {xiX 3 yi,X 2 ^y 3 ,X 2 X 3 y 2 ,X 3 ^y 3 ), 
V 26 = ix 3 yiy 3 ,xiy 2 ^,X 2 yiy 2 ,xiyi‘^), 
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and Qi, Q 2 are the positive semidefinite matriees 


Qi — 


2 


.1 1 _i 

2 2 


\ 2 2 ^ J 


/ 1 _i 1 _l\ 
2 2 


Q2 — 


-1 1 


1 1 
'2 2 


1 -1 1 -1 

2 2 


1 1 _1 1 


V 2 2 


( 22 ) 


❖ 


6 Computing with invariants 


The block-diagonalization techniques described in the previous section directly enable a 
reduction in the computational burden of checking SOS-based conditions. However, for ef- 
hciency reasons, in some cases it is desirable to employ an alternative procedure where all 
computations are done using only invariant elements, and if possible never deal at all with 
the original variables. 

The main benefit of computing only in the invariant ring is a much smaller number of 
algebraic operations, besides the fact that in many symmetric problems the original data is 
already expressed in a compact form, in terms of invariants (see Example 8 . 2 ). Additionally, 
the resulting formulation has interesting theoretical properties, that shed more light on the 
nature of the sum of squares techniques. 

It is well-known that the invariant ring R[x]'^ is hnitely generated as a R-algebra. Denoting 
a set of homogeneous generators by (71 (x ),... ,gs{'x) this fact can be written as 

R[x]^ = R[^i,...,^,]. 

Consequently, for each invariant polynomial r(x) there exists a polynomial r{yi,... ,ys) such 
that 

r(x) = r(^^i(x), ... ,^s(x)). 

Given the polynomial generators gi the algorithmic computation of an appropriate r is a 
standard task in computational algebra, for which several techniques are available. 

Grobner bases: Besides special techniques for distinguished algebra bases, the general Grobner 
bases approach may be used in this context. In [31] a method is explained based on slack 
variables yi,... ,ys and a Grobner basis of {yi — (71 (x ),... ,ys — ( 7 s(x)) with respect to an 
elimination order. The division algorithm applied to r gives a representation r. 

There are two special sets of generators having advantageous properties. 


19 



SAGBI bases: If the invariant ring admits a finite SAGBI basis [32, Chapter 11], r is deter¬ 
mined by the subduction algorithm. Since this technique does not use slack variables it is 
more efficient than the Grobner basis approach. A nice example is the case of the symmetric 
group, where the method is equivalent to the well-known algorithm for expressing an invari¬ 
ant polynomial in terms of the elementary symmetric functions. However, this variant is not 
always possible, as there exist groups whose invariant rings have no hnite SAGBI basis, for 
any term ordering. We will not follow this approach here. 

Hironaka decomposition: It can be shown (see [31] and the references therein), that the 
invariant ring is Cohen-Macaulay, and has a Hironaka decomposition: 

= 0hi(x)M[0i(x),...,0„(x)], 
j=i 

with 6 *j(x),pj(x) being primary and secondary invariants, respectively. In particular the 
primary invariants are algebraically independent. Every invariant polynomial can be written 
uniquely in terms of Oi,r]j with pj appearing linearly only. Algorithms for the computation 
of primary and secondary invariants are presented in [31,10,6]. This unique representation 
may be found by using Grobner bases with respect to special term orders or by involutive 
bases [5]. 

In this paper we use the Hironaka decomposition (i.e., primary and secondary invariants) as 
the computational representation for the invariant ring. The main reasons are the advantages 
of having a unique representation, the available results on the structure on the isotypic 
components (e.g.. Theorem 6.1 below), and the simplihcations that occur in important special 
cases. 

For instance, in practice very often the group representations are so-called reflection groups. 
This means that ■d(G) is generated by reflections with respect to hyperplanes. In this situation 
the invariant ring is isomorphic to a polynomial ring (no secondary invariants are needed) 
and the other isotypic components are free modules over the invariant ring [31,30]. Our 
results are particularly appealing in this situation. 

Example 5.4 continued: We revisit Example 5.4, from this algebraic perspective. The 
action of D 4 ^ is a reflection group and thus the invariant ring S = M[a;, y]^^ is generated by 
two primary invariants: 


9i{x,y) = + y^, 92 {x,y) = x^y^. 

No secondary invariants are needed, therefore the invariant ring is isomorphic to a polynomial 
ring. Rewriting f{x,y) in terms of the invariants using any of the methods described, we 
obtain f = R[ 6 *i, 6 * 2 ] with: 

f{x,y) = f{ 9 i{x,y), 92 {x,y)), f{ 9 i, 92 ) = 9 l- 9 l- 4 : 9 i 92 - 9 i + 592 + l. (23) 

❖ 
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A difficulty in trying to directly apply invariant-based techniques to sum of squares problems 
is that in a SOS decomposition the individual components (the squares) are not necessarily 
invariant, e.g. Example 5.1. Nevertheless, the different terms corresponding to the same 
isotypic component can be conveniently “bundled” together, as we show next. 

The representation (15) may be written in a different way. Since z^Qz = {Q , zz"’"), then for 
each i: 

rii rp j rii rp\ 

j=l \ j=l / 

using Pi(x) = 'i'ii(x)(uij(x))^ G (M[x])”^*^”^\ Since the decomposition is symmetry- 

adapted, the entries ph = jyjLi{vij)k{vij)i of the matrices Pi = {j>\i)ki are scalar products 
of equivariant polynomial vectors. Therefore, because the representations are assumed to be 
orthogonal, the phipi.) are invariant polynomials, obtaining the representation: 

/(x) = y (Q., P,{x)), P, € (Klxl^r'x™. 

i=l 

The matrices Pi have components which are invariant polynomials, and therefore can be 
expressed in terms of primary and secondary invariants 9i,pi. 

Since both / and the entries of Pi are invariant, there are unique representations f{9,p) = 
Yfj=iVjfj{9) and pli{9,p) being linear in p such that 

/(x) = /(0(x),77(x)) = ^r/j(x)/j(0i(x),...,0„(x)), phi^) = pli(e{^),p{^)y 

j=i 

So the sum of squares of an invariant polynomial corresponds to a representation 

i=l j=l 

where the matrices Qi are positive semidefinite. 

In order to compute in the coordinates Oi,pj we are left with the task to determine the 
matrices Pi. To provide an efficient representation of these matrices, we will exploit the 
algebraic structure of the module of equivariants. 

Theorem 6.1 ([35]) For each irreducible representation hii the module Mi of 3}-3}i-equivariants 
is a free module over the ring in the primary invariants: 

where b] G R[x]'^* denote the elements in the module basis and ri is the rank of the module. 

For some special cases the ranks r* are known a priori. For instance, in [30, Proposition 
4.7] it is stated that for groups generated by pseudo-reflections, the module of equivariants 
associated with one-dimensional irreducible representations have r* = 1. 
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In the case of a one-dimensional irreducible representation (n* = 1) we have Mi = Vi. In 
general the spaces Vij are generated by the components of the equivariants: 

We choose the special vector space basis ni(x) determined by the module basis, that is 

Vil 

\ (24) 

Vim 

Example 5.4 continued: As stated before has 5 irredueible representations of dimen¬ 
sions 1 and 2. For the one-dimensional representations the modules are generated by one 
element: r 2 = r^ = r 4 ^ = 1. The generators are xy{x^ — y"^) and xy and x"^ — y"^, respeetively. 
The fifth irredueible representation has dimension 2; the free module has two generators, 
namely 

( x^\ 

b2{x,y) = 

To make explieit this module strueture, we replace the basis elements ^ 51,^52 in (17) by the 
eguivalent ones: 

V 51 = (x, x^, (x^ + y^)x) = (x, 9ix ), V 52 = (y, |/^ + y^)y) = (y, |/^ diy) ■ 

<> 

Now we show how the information in the matrix Pi can be “compressed” to a much smaller 
one, of size equal to the module rank r*. 

Let Ilj be the x r* principal subblock of Pi dehned by the first rows and columns. Without 
loss of generality we can assume that the size of the matrix Pi is a multiple of r* (otherwise 
we add rows and columns of zeros to Qi). By the choice of vector basis in (24), the other 
r* X rj-subblocks of Pi are just monomial multiples of 11*. The monomials are just monomials 
in 6 only. Similarly, we split Qi into r* x rj-subblocks. We can then rewrite the inner products 
as operations between the polynomial matrices 11* and SOS matrices S'*, dehned in terms of 
Qi- We formally state our result as follows: 

Theorem 6.2 Let /(x) be a SOS polynomial, that is invariant under the action of a finite 
group G, and 9i,rii be primary and secondary invariants of the corresponding invariant ring. 
Then, f{9,rj) has a representation of the form: 

/(«.>)) = i;(S.(9),n,(9,,,)), (25) 
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where Ilj G R[ 6 *, 77 ]’’*^’’* depend only on the group aetion, and Si G are SOS matriees. 


Proof: For notational simplicity, in this proof we drop the index i denoting the i-th irre¬ 
ducible representation. For each real irreducible representation the matrix 11 G is 

dehned in the following way. The entry 7iki{d,v) is the invariant polynomial (6fc(x))^6i(x) 
represented in terms of fundamental invariants. We represent the basis v as in (24), where 
w = {wi,... is the vector of monomials in 6 . Using a weighted degree on ^[6^] and T 

induced by the degrees of di{'x.),rij{x.) it is clear that we can take w to contain all monomials 
in 9 of degree less than or equal to i(deg(/) — min^ degTr^fe). In particular, we take Wi = 1. 

As mentioned, in the chosen basis the matrix P is formed by blocks which are scalar copies 
of n, having the structure P = ww'^ ® IT. As a consequence, we have: 


{Q,p) = {Q, ww'^ ® n) = (Q, n 0 ww'^) = {Q, {ir ® tc) (n 01)(A (g) w)'^) 

= {{Ir ® wfQ{Ir (E)w),U) = {S, n). 

where Q is a permutation of the rows and columns of Q, and S is being dehned by the last 
equality. It is clear that S G is a SOS matrix as in Dehnition 2.2, since it satishes 

y'^Sy = {y <S) w)'^Q{y ^ w), and Q is positive semidehnite since Q is. An alternative formula 
for S can easily be obtained. Partition Q = into n x z/-subblocks. This yields 

{S)ji = w’^QSw for j = 1,..., r, / = 1,..., r. □ 

When the numerical value of the invariants 6 , rj correspond to real values of the original 
variables, then the matrices nj( 0 ,? 7 ) will always be positive semidehnite, since they are 
constructed as sums of outer products (notice that they are not PSD for arbitrary values of 
9, f]). The Ilj so generated therefore provide an “easily parameterizable” family of nonnegative 
polynomials, constructed by contracting the matrices with arbitrary SOS matrices S'*. Since 
inner products of positive semidehnite matrices are always nonnegative, the representation 
(25) provides a “self-evident” certihcate of the nonnegativity of /. 

Remark 6.3 The image ofW^ under the invariants (the orbit spacej is a semialgebraie set. 
Nonnegative symmetrie polynomials naturally correspond to polynomials that are nonnegative 
over this set; see [25], The 11* are related to the defining inegualities for this set, as well as 
its matching stratifications (semialgebraie and by orbit-type). The representation obtained 
closely matches what would be obtained by using the Positivstellensatz, an issue that will be 
pursued elsewhere. 

There are a few practical details that help in the computation of symmetric SOS represen¬ 
tations as presented above. For instance, since we know that the degree of the (A;A;)-entry 
of Si is bounded in weighted degree by deg(/) — deg(7r^^), it is convenient to use diherent 
monomial sets for the entries of S', rather than a common set w as described in the preceding 
proof. This is equivalent to eliminating a priori the zero rows and columns in Q. Also, using 
the unique representation property of the Hironaka decomposition, the required identity (25) 
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can be expressed concisely as: 


/,(9) = y(s.(9),n;(9)), j = 

i=l 

where 11^(6', rj) = J2j H- (0) rjj. 

Example 5.4 continued: From the isotypic components, and using the new basis, we di¬ 
rectly compute the matrices 111 , 112 , 113,114 G corresponding to the one-dimensional 

representations: 

ni = l, Il2 = 92[9l-A92), Ha = 02, n4 = 0? - 402- 
For the fifth component we have with = (1, 0i) the extended basis: 


V 51 = (x, x^, 9ix, 9ix^^ = (1, 0i) ® {x, x^) 
V 52 = {y, l/^ 9iy, 0i|/^) = (1, 0i) 0 {y, y^), 

yielding 
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where the ordering in the latter eorresponds to the basis 


V 51 = (x, 6 ix,x^, 6 ix^'^ = (x,x^) 0 

V52 = (y,^ly,y^dly^) = (y,y^) 0 


We have the identity: 


101 

\ 

/a 

1 

_1 

0llb 

— ( Q^, IIs ® 

01 02_ 

/ 

\ 



{Qbi P 5 ) — ( Qb, 

\ [Oi 

= {{I 2 ® Q^{l 2 0 W 5 ), IIs) = (S' 5 , IIs). 

where the explieit expression for S '5 is given by: 


Sb 


wjQhwb wjQu'f^b 
wlQl^w^ 


The construction guarantees S '5 is a SOS polynomial matrix, yielding the final decomposition: 


f{ 0 i, 02 )-X* 


+ ^ n,+ 

64 2 / 


(01 + 1)2 -f -201 
-1-201 4 



❖ 

The next example illustrates the results in a case where secondary invariants are involved. 
Example 6.4 Consider the cyclic group C 4 in four elements acting on as: 

{x,y) ^ {-y,x). 

The corresponding invariant ring M[a;, y]*""* is not isomorphic to a polynomial ring in two 
variables. Instead, it is generated by primary and secondary invariants given by: 

ei=x^+y‘^, e2=x^y^, Vi = l, r]2 = xy{x^ - y'^), 

that satisfy the relation (or syzygyj rj^ + 40| — 0^02 = 0. 

The group C 4 has four one-dimensional complex irreducible representations (4 ■ = \Cfi\). 

Specializing to the field of real numbers, two of these ones merge, so the dimensions over M 
are 1 , 1 , and 2 . 

The isotypic components ofM.[x,y] are: 
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V,=ner,e2]-{l,xy{x^-y^)} 

V^2 = IR[6'i,6'2] ■ {xy.x"^ -y'^] 

y’);' 

The corresponding matrices 11* take the form: 


Hi = 

1 r]2 

, n2 = 

1 

1 _ 

, n3 = 

01 0\ - 202 


V2 0\02 - ^0\ 


r]2 0\ - 4:02 


0f - 202 0f - 30102 


Therefore, by Theorem 6.2, every C^-invariant SOS polynomial has a representation of the 
form 

/(^ i ; ^ 2 , V2) = 62), nj ( 6 * i , 02,62)), 

i=l 

where the Si are sum of squares matrices. <0 

Remark 6.5 It is interesting to notice that some of the matrices 11* have simple interpreta¬ 
tions. Consider the case of the symmetric group Sn. It is well-known that the lowest dimension 
where the alternating representation appears non-trivially is equal to . In that case, the 
corresponding 11 is a scalar polynomial of degree n{n — 1), and equal to the discriminant 
I\i<j{xi — XjY of the polynomial IVf^i{z — Xi). In fact, explicit formulas for the II correspond¬ 
ing to one-dimensional representations of reflective groups can be obtained explicitly; see [30, 
Proposition 4-7]■ 




X 


7 The algorithm 


In the following we summarize the steps which are needed in order to efficiently compute a 
sum of squares representation of an invariant polynomial. Since several steps of the computa¬ 
tion are useful for all polynomials with a given invariance group, we distinguish between the 
operations that depend only on the specihc group (which can be thought of as a preprocessing 
step), and those that operate on the concrete instance. 

Algorithm I (Compute SOS generators) 

Input: linear representation d of a hnite group G on 

1. determine all real irreducible representations of G. 

2. compute primary and secondary invariants, 6i,rjj. 

3. for each non-trivial irreducible representation compute the basis b\,..., 6*^ 
of the module of equivariants. 
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4. for each irreducible representation i compute the corresponding matrix Ili{d,r]). 
Output: Primary and secondary invariants 6 ,r]. Matrices n*. 

Algorithm II (Sum of squares for invariant polynomials) 

Input: Primary and secondary invariants 6 ,r]. Matrices Ilj and / G M[x]'^. 

1. rewrite / (if needed) in fundamental invariants giving f{d,r]). 

2. for each irr. repr. determine Wi{6) and thus the structure of the matrices Si G M[6']. 

3. hnd a feasible solution of the semidehnite program corresponding to 

the constraints {Si are SOS matrices, / = flj)}. 

Output: SOS matrices Si providing a generalized sum of squares decomposition of /. 

In the following example the single steps of the algorithm are illustrated. 

Example 7.1 We consider the symmetric group S 3 operating by permutation of the vari¬ 
ables {x,y,z}. The group D 3 = S 3 has order six, with three irreducible representations of 
dimensions 1,1, and 2 (verifying 1^ + 1^ + 2‘^ = Q), shown in Table 2. All three are absolutely 
irreducible. 

The invariant ring M[a;, y, is the polynomial ring generated by the elementary symmetric 
polynomials ei = x -\- y -\- z,e 2 = xy + yz + xz, 63 = xyz, which constitute primary invariants 
(and moreover, a SAGBI basis). 

The modules are generated by 


bi = ix-y){x-z){y-z), b) = 


3 I {‘2x-y-z)/V2 


{y-z)V^/V2 


- 
O2 — 


/ \ 
{2yz — zx — xy) / \/2 

^ {zx - xy)\/^l\/2 


Computing the matrices Hi corresponding to each irreducible representation, we obtain: 


III = 1 , 112 = 6162-462-46163 + 18616263-2763, 


n3 = 


26 i — 662 —6162 + 963 

-6162 + 963 262 - 66163 


This completes Algorithm I collecting the results which depend on the group action only. 

Now we apply Algorithm II to the global minimization of the specific guartic trivariate poly¬ 
nomial presented in the Introduction in Example 2.4, and previously analyzed in 


f{x, y,z) = x'^ + y* + z‘^ — A xyz x + y + z. 
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Table 2 

Irreducible orthogonal representations of the symmetric group S 3 . The values oi a, P are a = ^, P = 
Notice that = 1, — P"^ = —a. 

As shown in [22], without using symmetry the associated semidefinite program has dimensions 
10 X 10, and the corresponding subspace has dimension equal to 20 (codimension is 35). 

Following Algorithm II the S^-invariant polynomial f is rewritten as 

/(e) = e)- Aeje 2 + 2 e 2 + dciCs - Aes + ei. 

The polynomial f has weighted degree 4. Since Hi has degree zero, the corresponding Si 
should have degree four. We collect all monomials of weighted degree 2 in the vector Wiie) = 
( 1 , 61 , 62 , 6 ^). The matrix W has weighted degree 6 and thus does not contribute to the SOS 
representation of f. The entries of II 3 have degrees 2,3,4 resulting in ^3(0) = (l,ei). But 
the last column and row of and Q‘f‘ are zero since the weighted degree of 71^2 O four. 

Therefore, the original 10 x 10 semidefinite program is now reduced to two smaller coupled 
SDPs, of dimensions 4x4 and 3x3. The number of variables decreased from twenty to five. 

Solving the coupled SDPs numerically, we obtain the value = —2.112913882, that coin¬ 
cides (in floating point arithmetic) with the algebraic number of degree 6 that is the optimal 
minimizer f* (see [22] for its minimum polynomial). For easy verification, we present here a 
rational approximate solution to the optimal value, obtained by rounding. This has been done 
in such a way to preserve the sum of squares property, thus providing a valid lower bound on 
the optimal value. For this, consider: 


Si = 


2113 

1000 


ei 

62 


e? 


^3 = 


79 


+ 6l + ||62 

79 

141 

2 1120..^ 

®1 11511^1^2 

T 






2113 

1 

79 

233 


1000 

2 

94 

496 


1 

13261 

560 

74 


2 

34968 

11511 

1279 


79 

560 

1439 

85469 


94 

11511 

2454 

377916 


233 

74 

85469 

85 


. 496 

1279 

377916 

693 

+ 

1279 1 ^ 

304 2 
693^1 

2 749 

9 1636 

ei 


148 3 I 

1279 1 ^ 


1439 2 _ 

2454*^2 


85469 2„ I 85 4 

188958^1^2 T" 693^1 



1 


ei 


62 


.e?. 


+ ^e^ 

9 ^ 1636 1 


3469 

4908 



It is immediate to eheek that Si, S 3 are indeed SOS, and that they satisfy f + = Si + 

therefore serving as a valid algebraie eertificate for the lower bound —2.113. <) 

In several of our examples, some irreducible representations do not appear in the final SDP 
(the corresponding basis is empty). This happens only because the polynomials are of rel¬ 
atively low degree. It can be shown, using the Molien series, that all the irreducible repre¬ 
sentations appear non-trivially in the induced representation on monomials of total degree 
d, for d sufficiently large (see [30]). 


8 Computational savings 


The exact amount of computational resources that can be spared at the SDP solving stage 
can only be quantihed precisely after dehning the specific symmetry group and the way it 
acts on M”. In what follows, we explicitly evaluate the computational savings obtained by 
exploiting symmetry, for two specihc group families. 

As mentioned in [22], the size of the semidefinite programs, when no symmetry is exploited, 
is equal to 

n + d\ fn + 2 d 

d j’ [ 2 d 

where N is the dimension of the matrix and m is the codimension of the subspace C. 

The first example will correspond to polynomials with even degrees, invariant under a natural 
action of . Secondly, we analyze the case of dense polynomials in n variables, of total degree 
2d, that are symmetric under the full permutation group Sn- 



8 .1 Even forms 


In this section we analyze a class of homogeneous polynomials in n variables arising from 
the study of copositive matrices, see [20]. These polynomials have the symmetry property: 

f{xi, ...,Xn) = f{±Xi, . . . , ±Xn), (26) 

for every possible combination of the signs. The associated group is the direct product of 
n copies of C 2 , and its order is equal to 2”. This group has 2” one-dimensional irreducible 
representations, and a complete set of fundamental invariants is given by di = x\,On = 

Each irreducible representation corresponds to the tensor product of n copies of the two 
representations of C 2 (namely, the trivial and the alternating), see Table 3. Let the type of 
a representation of (C' 2 )"' be the number of factors equal to the alternating representation. 
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id 

s 

■di (trivial) 

1 

1 

i32 (alternating) 

1 

-1 


Table 3 

Irreducible unitary representations of the group C 2 - 


Then, the Molien series for all 2"^ isotypic components can be simultaneously obtained from 
the identity: 


^ . t (;) ^ ^ £ (" " : - y- 

where ipr is the series corresponding to each of the irreducible representations of type r. 
For instance, for the invariant ring (i.e., the isotypic component corresponding to the trivial 
irreducible representation) the associated series is: 


£’o(0 


1 ^ +d—1\ 2d 

(i-er d y 


The 2" matrices Ilj in this case are scalars, and given by squarefree products of the 6 *,. 
Specializing the formulas above to the cubic terms (i.e., only the ones in ,^^), we see that: 

• There are n components of type 1, each with n monomials of the form {x^xi ,..., x^Xn}-, i = 

l,...,n. 

• There are 
3 < k. 

Therefore, the problem of checking a sum of squares condition for a sextic homogeneous 
polynomial satisfying the symmetry conditions (26) can be exactly reduced from an SDP of 
dimension to a coupled system of n smaller n x n SDPs with linear constraints 

(cf. [20, Theorem 5.2]). 

The savings can be even more dramatic when higher degree forms are considered. For in¬ 
stance, the standard approach for a homogeneous polynomial of degree 8 in n = 10 variables 
requires an SDP of dimension 715 x 715. The symmetry-aware methodology reduces this to 
coupled SDPs of dimensions 55 x 55 (1), 10 x 10 (45) and 1x1 (210), which are considerably 
easier to solve. The corresponding results for arbitrary order forms can be similarly obtained 
from the Molien series above. 


(^ 3 ) components of type 3, each with one monomial, of the form {xiXjXk},i < 
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Class 

Dimension 

Young Tableaux 

[4] 

1 

12 3 4 

[3,1] 

3 

1 3 4 

2 

12 4 12 3 

3 4 

[2,2] 

2 

13 12 

2 4 3 4 



1 2 

13 14 

[2,1,1] 

3 

3 

2 2 



4 

4 3 




1 

[1,1,1,1] 

1 


2 

3 




4 


Table 4 

Similarity classes and number of representations for the symmetric group 84 ^. 


8.2 Symmetric group Sn 


Here we analyze the case of polynomials in n variables, invariant under the symmetric group 
Sn acting by permutation of the indeterminates. As is well known, such S'^-invariant poly¬ 
nomials can be rewritten in terms of the elementary symmetric functions Cj. Since Sn is 
operating as reflection group the invariant ring is the polynomial ring M[ei,... ,6^]. More¬ 
over, the elementary symmetric polynomials form a SAGBI-basis. 

We briefly recall some classical results on the representation theory of the symmetric group Sn 
[14]. As with all finite groups, the number of mutually inequivalent irreducible representations 
is equal to the number of similarity classes. For Sn there is a bijection between similarity 
classes and additive partitions of n. Therefore, the number of irreducible representations is 
equal to the number of partitions of n and, by a celebrated result of Hardy and Ramanujan, 
asymptotic to 4 ^^^ The dimensions of the representations are given by the number 
of standard Young tableaux associated with a given partition. For instance, for 5*4 we can see 
in Table 4 that there are five irreducible representations, of dimensions 1,3,2,3,1, respectively. 


Example 8.1 A symmetric quadratic form in n variables has the representation /(ci, 62 ) = 
ae‘l + be 2 . We derive the conditions for f to be a sum of squares (or nonneqative, since in the 
quadratic case these notions coincide). By Theorem 6.2, and taking into account the degree 
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d = 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 


1 

1 

2 

3 

5 

6 

9 

11 

15 

18 

23 

27 

34 

39 

47 

54 

92 

0 

1 

2 

4 

6 

10 

14 

20 

26 

35 

44 

56 

68 

84 

100 

120 

93 

0 

0 

1 

1 

3 

4 

7 

9 

14 

17 

24 

29 

38 

45 

57 

66 

9a 

0 

0 

0 

1 

2 

4 

6 

10 

14 

20 

26 

35 

44 

56 

68 

84 

95 

0 

0 

0 

0 

0 

0 

1 

1 

2 

3 

5 

6 

9 

11 

15 

18 

Total 

1 

4 

10 

20 

35 

56 

84 

120 

165 

220 

286 

364 

455 

560 

680 

816 


Table 5 

Dimension of the isotypic components of ^4 acting on the monomials of total degree d, not counting 
multiplicities. The last row is the total dimension of the space, equal to 

f j if the form is SOS there exists a representation 

/(ci, 02 ) = Si ■ 1 + S 2 ■ ( 112)11 = (o^i) ■ 1 + C 2 ■ [(n — l)e^ — 277 , 62 ]. 

where Ci,C 2 are nonnegative eonstants. This gives rise to the SDP (aetually, just a linear 
program): 

a = Cl + (77 - 1) 62 , &= -27762, Ci,C2>0. 

Therefore, the necessary and sufficient conditions for f to be a sum of sguares are {2n)a + 
(77 — l)b > 0,b < 0 . 

In general, there seem to be no explicit formulas for the dimensions of the isotypic components 
of the polynomial ring (other than the Molien series), so either asymptotic results or special 
cases should be analyzed instead. In the example below, we focus on the symmetric group 
in four variables 5*4. 

Example 8.2 We consider two examples taken from a paper by Sottile [29] arising from a 
conjecture of Shapiro and Shapiro. These polynomials are invariant under the full permu¬ 
tation group in four variables. The group 5*4 has five irreducible representations, of degrees 
egual to 1,3,2,3 and 1. 

In Table 5 we present the dimensions of the corresponding isotypic components when the 
group acts on the space of monomials of degree d. The entries were computed using the 
Molien-Hilbert series. 

Consider for example the case of a guartic form (2d = appearing in [29, Example 2.2]: 

/(e) = 1662 — 486163 + 192c4, 

where the Cj are the elementary symmetric functions in {s,t,u,v). According to Table 5, 
the problem can be reduced to a simple SDP of dimensions 2,2 and 1. The corresponding 
symmetry-adapted vector basis is: 
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Vi = + v'^, st + su + sv + tu + tv + uv) 

V21 = — v‘^ — US — vt) 

V22 = (u'^ + v'^ — s‘^ — uv — st) 

V23 = sv — tu) 

V31 = {uv + st — sv — tu) 

1 

iFrom the invariant representation viewpoint introduced in Section 6, the polynomial de¬ 
scribed has a very simple form, as it corresponds exactly to a scalar multiple of the (1,1) 
entry of the matrix IIs. In other words, we have: 


I J- c>-/- J_ 



/(e) = 






the value of c depending on the specific normalization chosen for IIs. It is easy then to find 
the explicit decomposition in terms of the original variables: 

1 

73 


f{s, t, u, v) = 12(uv st — sv — tu^ + 12 ^ 



We also applied the methods to the degree 2d = 20 symmetric polynomial in [29, Theorem 
2.3], From the table, we can easily appreciate the benefits of symmetry exploitation: for this, 
instead of solving a single SDP of dimension 286 x 286, five coupled smaller ones, of di¬ 
mensions 44, 26, 24, 23, 5 are employed (even smaller SDPs can be achieved by exploiting the 
sparsity pattern). 

By appropriately choosing the objective function of the SDP, we guickly found a particular 
numerical solution with support in only two of the five isotypic components (Di and D 2 ), 
therefore verifying its nonnegativity. Even though Sottile was able to find an explicit set of 
polynomials with integer coefficients whose sguares add up to f, in the representation we 
found these do not seem to have “nice” rational expressions. While there is no reason to 
expect our methods to yield solutions with particular properties such as rationality, this is 
sometimes the case for extremal solutions. Nevertheless, verification of the solution can still 
be done independently, by approximating an interior solution with rational numbers as was 
done in Example 7.1. <0 
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